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Claims 

What is claimed is: 

1 . A method of optimized costing, the method comprising: 

5 identifying a join that identifies a first table and a second table, the first table comprising one or 

more unique first table values, the second table comprising one or more unique second table 
values; 

determining an optimized cost of reading the first table, the optimized cost of reading the first 
table comprising: 

10 if the number of unique first table values is greater than the number of unique second 

table values, 

returning the number of instances where a unique first table value matches a 
unique second table value ; 

otherwise, 

15 returning the number of unique first table values; 

determining an optimized cost of reading the second table, the optimized cost of reading the 

second table comprising the number of unique second table values; and 
summing the optimized cost of reading the first table and the optimized cost of reading the 

second table. 

20 

2. The method of Claim 1, further comprising multiplying the optimized cost of reading the first 
table by a multiplier if the number of unique first table values is greater than the number of unique 
second table values. 

25 3. The method of Claim 2, wherein the multiplier is the number of unique second table values 
divided by the number of unique first table values. 

4. The method of Claim 1, wherein determining the optimized cost of reading the second table 
further comprises performing unique sorting on the right table. 

30 
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5 . The method of Claim 1 , further comprising: 

determining a maximum cost associated with the join, the maximum cost comprising the sum 
of an unoptimized cost of reading the first table and an unoptimized cost of reading the 
second table; 

5 comparing the maximum cost associated with the join to the sum of the optimized cost of 

reading the first table and the optimized cost of reading the second table; and 
returning the maximum cost if the maximum cost is less than the sum of the optimized cost of 
reading the first table and the optimized cost of reading the second table. 



10 6. The method of Claim 1, wherein determining an optimized cost of reading the first table further 
comprises: 

assigning a confidence level to the first table; and 
assigning a confidence level to the second table. 



15 7. A method of optimized costing, the method comprising: 

identifying a join that identifies a first table and a second table, the first table comprising one or 
more unique first table values, the second table comprising one or more unique second table 
values; 

removing one or more duplicate instances of each of the one or more unique second table 
20 values to determine an optimized cost of reading the second table; 

exiting the join after each of the first unique table values is matched to a second unique table 

value to determine an optimized cost of reading the first table; and 
summing the optimized cost of reading the first table and the optimized cost of reading the 
second table. 

25 

8. The method of Claim 7, further comprising: 

determining whether the number of unique first table values is greater than the number of unique 
second table values; 

if the number of unique first table values is greater than the number of unique second table 
30 values, 

multiplying the optimized cost of reading the first table by a multiplier; 
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otherwise, 

returning the number of unique first table values. 

9. The method of Claim 8, wherein the multiplier is the number of unique second table values 
divided by the number of unique first table values. 

10. The method of Claim 7, wherein removing the one or more duplicate instances of each of the one 
or more unique second table values comprises: 

performing unique sorting on the right table; and 
returning the number of unique second table values. 

1 1 . The method of Claim 7, further comprising: 

determining a maximum cost associated with the join, the maximum cost comprising the sum of 
an unoptimized cost of reading the first table and an unoptimized cost of reading the second 
table; 

comparing the maximum cost associated with the join with the sum of the optimized cost of 
reading the first table and the optimized cost of reading the second table; and 

returning the maximum cost if the maximum cost is less than the sum of the optimized cost of 
reading the first table and the optimized cost of reading the second table. 

12. The method of Claim 7, wherein exiting the join after each of the first unique table values is 
matched to a second unique table value further comprises: 

assigning a confidence level to the first table; and 
assigning a confidence level to the second table. 

13. A database system for accessing a database, the database system including: 
a massively parallel processing system including 

one or more nodes; 

a plurality of CPUs, each of the one or more nodes providing access to one or more 
CPUs; 

a plurality of virtual processes each of the one or more CPUs providing access to one or 
more processes; and 
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each process configured to manage data stored in one of a plurality of data-storage 
facilities; 

an optimizer for optimizing costing, the optimizer including: 

a process for identifying a join that identifies a first table and a second table, the first 
5 table comprising one or more unique first table values, the second table comprising 

one or more unique second table values; 
a process for determining an optimized cost of reading the first table, the optimized cost 
of reading the first table comprising: 

if the number of unique first table values is greater than the number of unique 
10 second table values, 

a process for returning the number of instances where a unique first table 
value matches a unique second table value ; 

otherwise, 

a process for returning the number of unique first table values; 
15 a process for determining an optimized cost of reading the second table, the optimized 

cost of reading the second table comprising the number of unique second table 
values; and 

a process for summing the optimized cost of reading the first table and the optimized 
cost of reading the second table. 

20 

14. The database system of Claim 13, wherein the optimizer further includes a process for 
multiplying the optimized cost of reading the first table by a multiplier if the number of unique first 
table values is greater than the number of unique second table values. 

25 15. The database system of Claim 14, wherein the multiplier is the number of unique second table 
values divided by the number of unique first table values. 

16. The database system of Claim 13, wherein the process for determining the optimized cost of 
reading the second table further comprises a process for performing unique sorting on the right table. 

30 
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17. The database system of Claim 13, wherein the optimizer further includes: 

a process for determining a maximum cost associated with the join, the maximum cost 

comprising the sum of an unoptimized cost of reading the first table and an unoptimized 

cost of reading the second table; 
a process for comparing the maximum cost associated with the join to the sum of the optimized 

cost of reading the first table and the optimized cost of reading the second table; and 
a process for returning the maximum cost if the maximum cost is less than the sum of the 

optimized cost of reading the first table and the optimized cost of reading the second table. 

18. The database system of Claim 13, wherein the process for determining an optimized cost of 
reading the first table further comprises: 

assigning a confidence level to the first table; and 
assigning a confidence level to the second table. 

19. A database system for accessing a database, the database system including: 
a massively parallel processing system including 

one or more nodes; 

a plurality of CPUs, each of the one or more nodes providing access to one or more 
CPUs; 

a plurality of virtual processes each of the one or more CPUs providing access to one or 
more processes; and 

each process configured to manage data stored in one of a plurality of data-storage 
facilities; 

an optimizer for optimizing costing, the optimizer including: 

a process for identifying a join that identifies a first table and a second table, the first 

table comprising one or more unique first table values, the second table comprising 

one or more unique second table values; 
a process for removing one or more duplicate instances of each of the one or more 

unique second table values to determine an optimized cost of reading the second 

table; 

a process for exiting the join after each of the first unique table values is matched to a 
second unique table value to determine an optimized cost of reading the first table; 
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and 

a process for summing the optimized cost of reading the first table and the optimized 
cost of reading the second table. 

20. The database system of Claim 19, wherein the optimizer further includes: 

a process for determining whether the number of unique first table values is greater than the 

number of unique second table values; 
if the number of unique first table values is greater than the number of unique second table 

values, 

a process for multiplying the optimized cost of reading the first table by a multiplier; 
otherwise, 

a process for returning the number of unique first table values. 

21. The database system of Claim 20, wherein the multiplier is the number of unique second table 
values divided by the number of unique first table values. 

22. The database system of Claim 19, wherein the process for removing the one or more duplicate 
instances of each of the one or more unique second table values comprises: 

performing unique sorting on the right table; and 
returning the number of unique second table values. 

23. The database system of Claim 19, wherein the optimizer further includes: 

a process for determining a maximum cost associated with the join, the maximum cost 

comprising the sum of an unoptimized cost of reading the first table and an unoptimized 

cost of reading the second table; 
a process for comparing the maximum cost associated with the join with the sum of the 

optimized cost of reading the first table and the optimized cost of reading the second table; 

and 

a process for returning the maximum cost if the maximum cost is less than the sum of the 
optimized cost of reading the first table and the optimized cost of reading the second table. 
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24. The database system of Claim 19, wherein the process for exiting the join after each of the first 
unique table values is matched to a second unique table value further comprises: 

assigning a confidence level to the first table; and 
assigning a confidence level to the second table. 

5 

25. A computer program, stored on a tangible storage medium, for optimizing costing, the program 
including executable instructions that cause a computer to: 

identify a join that identifies a first table and a second table, the first table comprising one or 
more unique first table values, the second table comprising one or more unique second table 
10 values; 

determine an optimized cost of reading the first table, the optimized cost of reading the first 
table comprising: 

if the number of unique first table values is greater than the number of unique second 
table values, 

15 returning the number of instances where a unique first table value matches a 

unique second table value; 

otherwise, 

returning the number of unique first table values; 
determine an optimized cost of reading the second table, the optimized cost of reading the 
20 second table comprising the number of unique second table values; and 

sum the optimized cost of reading the first table and the optimized cost of reading the second 
table. 

26. The computer program of Claim 25 including executable instructions that further cause a 
25 computer to multiply the optimized cost of reading the first table by a multiplier if the number of 

unique first table values is greater than the number of unique second table values. 

27. The computer program of Claim 26, wherein the multiplier is the number of unique second table 
values divided by the number of unique first table values. 

30 
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28. The computer program of Claim 25, where when determining the optimized cost of reading the 
second table, the computer: 

performs unique sorting on the right table. 

5 29. The computer program of Claim 25 including executable instructions that further cause a 
computer to: 

determine a maximum cost associated with the join, the maximum cost comprising the sum of 
an unoptimized cost of reading the first table and an unoptimized cost of reading the second 
table; 

10 compare the maximum cost associated with the join to the sum of the optimized cost of reading 

the first table and the optimized cost of reading the second table; and 
return the maximum cost if the maximum cost is less than the sum of the optimized cost of 
reading the first table and the optimized cost of reading the second table. 

15 30. The computer program of Claim 25, where when determining an optimized cost of reading the 
first table, the computer: 

assigns a confidence level to the first table; and 
assigns a confidence level to the second table. 

20 31. A computer program, stored on a tangible storage medium, for optimizing costing, the program 
including executable instructions that cause a computer to: 

identify a join that identifies a first table and a second table, the first table comprising one or 
more unique first table values, the second table comprising one or more unique second table 
values; 

25 remove one or more duplicate instances of each of the one or more unique second table values 

to determine an optimized cost of reading the second table; 
exit the join after each of the first unique table values is matched to a second unique table value 

to determine an optimized cost of reading the first table; and 
sum the optimized cost of reading the first table and the optimized cost of reading the second 
30 table. 
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32. The computer program of Claim 31, including executable instructions that further cause a 
computer to: 

determine whether the number of unique first table values is greater than the number of unique 
second table values; 

5 if the number of unique first table values is greater than the number of unique second table 

values, 

multiply the optimized cost of reading the first table by a multiplier; 
otherwise, 

return the number of unique first table values. 

10 

33. The computer program of Claim 32, wherein the multiplier is the number of unique second table 
values divided by the number of unique first table values. 



34. The computer program of Claim 31, where when removing the one or more duplicate instances 
15 of each of the one or more unique second table values, the computer: 
performs unique sorting on the right table; and 
returns the number of unique second table values. 



35. The computer program of Claim 31, including executable instructions that further cause a 
20 computer to: 

determine a maximum cost associated with the join, the maximum cost comprising the sum of an 
unoptimized cost of reading the first table and an unoptimized cost of reading the second 
table; 

compare the maximum cost associated with the join with the sum of the optimized cost of 
25 reading the first table and the optimized cost of reading the second table; and 

return the maximum cost if the maximum cost is less than the sum of the optimized cost of 
reading the first table and the optimized cost of reading the second table. 



36. The computer program of Claim 31, where when exiting the join after each of the first unique 
30 table values is matched to a second unique table value, the computer: 
assigns a confidence level to the first table; and 
assigns a confidence level to the second table. 
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